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Abstract 

This paper studies the training design problem for multiple-input single-output (MISO) wireless energy transfer 
(WET) systems in frequency-selective channels, where the frequency-diversity and energy-beamforming gains can 
be both reaped to maximize the transferred energy by efficiently learning the channel state information (CSI) at the 
energy transmitter (ET). By exploiting channel reciprocity, a new two-phase channel training scheme is proposed to 
achieve the diversity and beamforming gains, respectively. In the first phase, pilot signals are sent from the energy 
receiver (ER) over a selected subset of the available frequency sub-bands, through which the ET determines a certain 
number of “strongest” sub-bands with largest antenna sum-power gains and sends their indices to the ER. In the 
second phase, the selected sub-bands are further trained by the ER, so that the ET obtains a refined estimate of the 
corresponding MISO channels to implement energy beamforming for WET. A training design problem is formulated 
and optimally solved, which takes into account the channel training overhead by maximizing the net harvested energy 
at the ER, defined as the average harvested energy offset by that consumed in the two-phase training. Moreover, 
asymptotic analysis is obtained for systems with a large number of antennas or a large number of sub-bands to gain 
useful insights on the optimal training design. Finally, numerical results are provided to corroborate our analysis 
and show the effectiveness of the proposed scheme that optimally balances the diversity and beamforming gains 
achieved in MISO WET systems with limited-energy training. 

Index Terms 

Wireless energy transfer (WET), energy beamforming, channel training, RF energy harvesting, frequency diver¬ 
sity. 


I. Introduction 

Wireless energy transfer (WET), by which energy is delivered wirelessly from one location to another without the 
hassle of connecting cables, has emerged as a promising solution to provide convenient and reliable power supplies 
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for energy-constrained wireless networks, such as radio-frequency identification (RFID) and sensor networks |TJ, 
0. One enabling technique of WET for long-range applications (say up to tens of meters) is via radio-frequency 
(RF) or microwave propagation, where dedicated energy-bearing signals arc sent from the energy transmitter (ET) 
over a particular frequency band to the energy receiver (ER) for harvesting the received RF energy (see e.g. |lj, 
[2j and the references therein). 

By exploiting the fact that RF signal is able to convey both information and energy, extensive studies have been 
recently devoted to unifying both information and energy transfer under the framework so-called simultaneous 
wireless information and power transfer (SWIPT) |3j]-|5J. The advancement of WET techniques has also opened 
several interesting applications for SWIPT and other wireless-powered communication networks (WPCN) [JTJ, (6}, 
such as cognitive radio networks [7j, |8 ] and relay networks |9|-fl2{. The key practical challenge for implementing 
WET is to improve its energy transfer efficiency, which is mainly limited by the significant power attenuation loss 
over distance. In line-of-sight (LOS) environment, the efficiency can be enhanced via uni-directional transmission, 
by designing the antenna radiation patterns to form a single sharp beam towards the ER for energy focusing. 
Recently, multi-antenna based technique, termed energy beamfowning, has been proposed for WET |3|, where 
multiple antennas are employed at the ET to more efficiently and flexibly direct wireless energy to one or more 
ERs via digital beamforming, especially in non-LOS environment with multiple signal paths from the ET to ERs. 
Similar to the emerging massive multiple-input multiple-output (MIMO) enabled wireless communications (see 
e.g. (T3), 0 and the references therein), by equipping a large number of antennas at the ET, enormous energy 
beamforming gain can be achieved; hence, the end-to-end energy transfer efficiency can be dramatically enhanced 


On the other hand, for WET in a wide-band regime over frequency-selective channels, the frequency-diversity 
gain can also be exploited to further improve the energy transfer efficiency, by transmitting more power over the 
sub-band with higher channel gain. Ideally, maximum energy transfer efficiency is attained by sending just one 
sinusoid at the frequency that exhibits the strongest channel frequency response [16j. In practice, it may be necessary 
to spread the transmission power over multiple strong sub-bands in order to comply with various regulations, such 
as the power spectral density constraint imposed on the license-free ISM (industrial, scientific and medical) band 
[T7j. WET in single- [16], [T8j, 1191 and multi-antenna [20] frequency-selective channels have been studied under 
the more general SWIPT setup, where perfect channel state information (CSI) is assumed at the transmitter. 

In practice, both the energy-beamforming and frequency-diversity gains can be exploited for multi-antenna WET 
in frequency-selective channels; while they crucially depend on the available CSI at the ET, which needs to be 
practically obtained at the cost of additional time and energy consumed. Similar to wireless communication, a 


direct approach to obtain CSI is by sending pilot signals from the ET to the ER [211, [22|, which estimates the 


corresponding channel and then sends the estimation back to the ET via a finite-rate feedback link (see e.g. [231 


and the references therein). However, since the training overhead increases with the number of antennas M at the 
ET, this method is not suitable when M is large (24J. Furthermore, as pointed out in 1251, estimating the channel 
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at the ER requires complex baseband signal processing, which may not be available at the ER due to its practical 
hardware limitation. A new channel-learning design to cater for the practical RF energy harvesting circuitry at the 
ER has thus been proposed in ||25|. However, the training overhead still increases quickly with M, and can be 


prohibitive as M becomes large. In [261, by exploiting channel reciprocity between the forward (from the ET to 
the ER) and reverse (from the ER to the ET) links, we have proposed an alternative channel-learning scheme for 
WET based on reverse-link training by sending pilot signals from the ER to the ET, which is more efficient since 


the training overhead is independent of M. Moreover, it is revealed in [26| that a non-trivial trade-off exists in 
WET systems between maximizing the energy beamforming gain by more accurately estimating the channel at the 
ET and minimizing the training overhead, including the time and energy used for sending the pilot signals from 


the ER. To optimally resolve the above trade-off, a training design framework has been proposed in [26] based 
on maximizing the net harvested energy at the ER, which is the average harvested energy offset by that used for 
channel training. However, the proposed design in |26j applies only for narrowband flat-fading channels instead of 
the more complex wide-band frequency-selective fading channels, which motivates the current work. 

In this paper, we study the channel learning problem for multiple-input single-output (MISO) point-to-point 
WET systems over frequency-selective fading channels. Compared to its narrowband counterpart studied in p6| [, 
the channel frequency selectivity in MISO wide-band WET system provides an additional frequency-diversity gain 


for further improvement of the energy transfer efficiency [271. However, the training design also becomes more 
challenging, since the channels in both the frequency and spatial domains need to be estimated. In particular, 
achieving the frequency-diversity gain requires only the knowledge of the total channel power from all the M 
transmit antennas at the ET, but for a potentially large number of sub-bands; while the energy beamforming gain 
relies on a more accurate estimation of the MISO channel (both phase and magnitude for each antenna), but 
only for a small subset of sub-bands selected with largest channel power for energy transmission. Therefore, an 
efficient training design for multi-antenna wide-band WET needs to achieve a good balance between exploiting 
the frequency-diversity gain (by training more independent sub-bands) versus the energy beamforming gain (by 
allocating more training energy to the selected sub-bands), and yet without consuming excessive energy at the ER, 
as will be pursued in this paper. Specifically, the main contributions of this paper are summarized as follows: 

• First, under channel reciprocity, we propose a new two-phase channel training scheme for MISO wide-band 
WET, to exploit the frequency-diversity and energy beamforming gains, respectively. In phase-I training, pilot 
signals are sent from the ER over a selected subset of the available frequency sub-bands. Based on the received 
total energy across all the antennas at the ET over each of the trained sub-bands, the ET determines a certain 
number of sub-bands with largest antenna sum-power gains and sends their ordered indices to the ER. In phase- 
II training, the selected sub-bands are further trained by the ER, so that the ET obtains a refined estimate of 
their MISO channels to implement energy beamforming over them for WET. The proposed two-phase training 
protocol is more effective than the single-phase training schemes with phase-I or phase-II training only, or the 
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brute-force training scheme by estimating the (exact) MISO channels at all the available sub-bands. 

• Second, based on the order statistic analysis [j28|], a closed-form expression is derived for the average harvested 
energy at the ER with the proposed two-phase training scheme, in terms of the number of sub-bands trained, as 
well as the pilot signal energy allocated in the two training phases. An optimization problem is thus formulated 
to maximize the net harvested energy at the ER, which explicitly takes into account the energy consumption at 
the ER for sending the pilot signals. The formulated problem is optimally solved based on convex optimization 
techniques to obtain the optimal training design. 

• Third, we provide the asymptotic analysis for systems with a large number of antennas (massive-array WET) or 
a large number of independent sub-bands (massive-band WET) to gain further insights on the optimal training 
design. It is shown that while the net harvested energy at the ER linearly increases with the number of ET 
antennas asymptotically, it is upper-bounded by a constant value as the number of independent sub-bands goes 
to infinity. The latter result is a consequence of the faster increase in training energy cost (linearly) than that in 
the achieved frequency-diversity gain for WET (logarithmically) as more independent sub-bands are trained. 

The rest of this paper is organized as follows. Section [II] introduces the system model. Section III discusses the 
proposed two-phase training scheme for MISO wide-band WET and formulates the training optimization problem. 
In Section [TV] the formulated problem is optimally solved to obtain the optimal training strategy. In Section |V| 
asymptotic analysis is provided for massive-array and massive-band WET systems, respectively. Numerical results 
are provided in Section VI to corroborate our study. Finally, we conclude the paper in Section |VII 


Notations: In this paper, scalars are denoted by italic letters. Boldface lower- and upper-case letters denote vectors 
and matrices, respectively. C MxN denotes the space of M x N complex matrices. E[-] denotes the expectation 
operator. I m denotes an M-dimensional identity matrix and 0 denotes an all-zero matrix of appropriate dimension. 
For an arbitrary-size matrix A, its complex conjugate, transpose, and Hermitian transpose are respectively denoted 
as A*, A 7 and A 11 . ||x|| denotes the Euclidean norm of vector x. For a real number x, [x] + = max{x, ()}. 
a! denotes the factorial of a and represents the binomial coefficient. Furthermore, CAf(ji. C) represents the 
circularly symmetric complex Gaussian (CSCG) distribution with mean // and covariance matrix C. Finally, for 
two positive-valued functions f(x) and g(x), the notation f(x) = Q(g(x)) implies that 3c\ , C 2 > 0 such that 
cig(x) < f(x) < C 2 g(x) for sufficiently large x. 


II. System Model 

A. MISO Multi-Band WET 

We consider a MISO point-to-point WET system in frequency-selective channel, where an ET equipped with 
M > 1 antennas is employed to deliver wireless energy to a single-antenna ER. We assume that the total available 
bandwidth is B, which is equally divided into N orthogonal sub-bands with the nth sub-band centered at frequency 
f n and of bandwidth B s = B/N. We assume that B s < B c , where B c denotes the channel coherence bandwidth, 
so that the channel between the ET and ER experiences frequency-flat fading within each sub-band. Denote h„ e 
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Fig. 1: Schematics of the multi-antenna multi-band wireless energy transmitter. 

C Mxl , n = 1, • • • , N, as the complex-conjugated baseband equivalent MISO channel from the ET to the ER 
in the nth sub-band. We assume a quasi-static Rayleigh fading model, where h„ remains constant within each 
block of T < T c seconds, with T c denoting the channel coherence time, but can vary from one block to another. 
Furthermore, the elements in h n are modeled as independent and identically distributed (i.i.d.) zero-mean CSCG 
random variables with variance /3, i.e., 

h n ~ CN{Q,fH M ), n = 1, • • ■ ,N, (1) 

where (5 models the large-scale fading due to shadowing as well as the distance-dependent path loss. 

Within each block of T seconds, i.e., 0 < t < T, the input-output relation for the forward link energy transmission 
can be expressed as 

Vn{t) = hfx n (f) + Zn(t), n = 1, • • • ,N, (2) 

where y n (t ) denotes the received signal at the ER in the nth sub-band; x n (t) £ C Mxl denotes the baseband energy¬ 
bearing signals transmitted by the ET in sub-band n; and z n (t ) denotes the additive noise at the ER. Different 
from wireless communication where random signals need to be transmitted to convey information, x n (i) in Q 
is designated only for energy transmission and thus can be chosen to be deterministic. The average power p n 
transmitted over sub-band n can then be expressed as 

1 f T 

Pn = ^J ||x n (f)|| 2 di, n = 1, • • • ,N. (3) 

At the ER, the incident RF power captured by the antenna is converted to direct current (DC) power for battery 
replenishment by a device called rectifier [Q. We assume that the RF energy harvesting circuitry is able to achieve 
efficient energy harvesting over the entire operation frequency band. By ignoring the energy harvested from the 
background noise which is practically small, the total harvested energy at the ER over all N sub-bands during each 
block can be expressed as f3j 

N t 

Q = r lY^ i \ h n*n{t)\ 2 dt, (4) 

n= 1 J ° 

where 0 < rj < 1 denotes the energy conversion efficiency at the ER. Without loss of generality, x n (f) can be 
expressed as (see Fig. [T] for the transmitter schematics) 


x„(f) = s n g(t), 0 < t < T, n = 1, • • • , N, 


(5) 













6 


where s n G C Mx 1 denotes the energy beamforming vector for the nth sub-band, and g(t) represents the pulse¬ 
shaping waveform with normalized power, i.e., j \g(t)\ 2 dt = 1. Note that the bandwidth of g(t), which is 
approximately equal to 1/T, needs to be no larger than B s . We thus have 


1 

T 7 

- L C 


<-<B s <B 


Cl 


or T C B C 


> 1, i.e., a so-called “under-spread” wide-band fading channel is assumed [29]. 


B. Optimal WET with Perfect CSI 

With ([3]) and ([5]), the power p n transmitted over sub-band n can be expressed as p n = ||s n |j 2 , Vn. Furthermore, 
the harvested energy Q in Q can be written as Q = r/T Yln= i |h^s n | 2 . In the ideal case with perfect CSI, i.e., 
{h n }^ =1 , at the ET, the optimal design of {s n }^ =1 that maximizes Q can be obtained by solving the following 
problem 

N 

max rjT El h ? s »l 

n= 1 

subject to ||s n |[ 2 < Pti 

71=1 

||s n || 2 < P s , Vn, 

where P t denotes the maximum transmission power at the ET across all the N sub-bands, and P s corresponds 
to the power spectrum density constraint for each sub-band G3- Without loss of generality, we assume that 
P s <P t < NP S , since otherwise at least one of the constraint in <(6]) is redundant and hence can be removed. 
In addition, for the convenience of exposition, we assume that P t is a multiple of P s , i.e., Pt/P s = N 2 for some 
integer 1 < N 2 < N. It can then be obtained that the optimal solution to problem ([6]) is 

® [n] S 

[ 0 , n = N 2 + l,--- ,N, 

where [•] is a permutation such that ||h[ 1 ]|| 2 > ||h[ 2 ]|| 2 • • • > ||h[^v] II 2 - The resulting harvested energy is 

n 2 

Q'dea' = r?T P s ^ || h[n] ||2. (8) 

71=1 

It is observed from ([7]) that for a MISO multi-band WET system with P t < NP S , or N-> < N, the optimal scheme 
is to transmit over the N 2 strongest sub-bands only, each with the maximum allowable power. As a result, the 
remaining N — N 2 unused sub-bands can be opportunistically re-used for other applications such as information 
transmission. The solution in ([7]) also shows that for each of the N 2 selected sub-bands, maximum ratio transmission 
(MRT) should be performed across different transmit antennas to achieve the maximum energy beamforming gain. 
Moreover, Q implies that for multi-antenna WET systems in frequency-selective channels, both frequency-diversity 
and energy beamforming gains can be achieved to maximize the energy transfer efficiency in the ideal case with 
perfect CSI at the ET. 
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(a) First phase: Nj selected sub-bands with maximum frequency 
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(b) Second phase: the N 2 strongest sub-bands are further trained 


Fig. 2: Two-phase channel training for multi-antenna multi-band wireless energy transfer. 


C. Proposed Two-Phase Training Protocol 

In practice, the CSI needs to be obtained at the ET via channel training and/or feedback from the ER, which 
are subject to channel estimation and/or feedback errors and incur additional energy and time costs. Consequently, 
the maximum harvested energy given in ([8]) cannot be attained; instead, it only provides a performance upper 
bound for practical WET systems, for which efficient channel-learning schemes need to be designed. In this paper, 
by exploiting the channel reciprocity that the forward and reverse link channels are transpose of each other, we 
propose a two-phase channel training scheme to reap both the frequency-diversity and energy beamforming gains. 
As illustrated in Fig. [2] the first phase corresponds to the first n < T seconds of each block, where pilot signals are 
sent by the ER to the ET over N\ out of the N available sub-bands, each with energy E\ , where N 2 < N\ < N. 
By comparing the total received energy from all M antennas at the ET over each of the N) trained sub-bands, the 
ET determines the N 2 strongest sub-bands with largest antenna sum-power gains, and sends their corresponding 
orders and indices to the ER. In the second phase of 72 < T — t\ seconds, additional training signal is sent by the 
ER over the N 2 selected sub-bands, each with possibly different training energy depending on their relative order as 
determined in phase I. The ET then obtains an estimate of the MISO channel for each of the N 2 selected sub-bands, 
based on which MRT beamforming is performed for energy transmission during the remaining T — n — T 2 seconds. 
The proposed two-phase training scheme is elaborated in more details in the next section. 

III. Problem Formulation 

A. Two-Phase Training 

1) Phase-I Training: Denote by Aj c {1, • • • , N} the set of the N\ trained sub-bands in phase I, with N 2 < 
N\ < N. To maximize the frequency-diversity gain, the sub-bands with the maximum frequency separations are 
selected in Af\ so that their channels are most likely to be independent (see Fig. [2j a)), e.g., if N\ = 2, we have 
Af\ = {1, A}. The received training signals at the ET can be written as 

rjj(f) = y/E[\i* n ^ n {t) + w l n (t), 0<t<Ti, n 6 A (9) 

where E\ denotes the training energy used by the ER for each trained sub-band; represents the training 

waveform for sub-band n with normalized energy, i.e., J'J 1 \cj) n (t)\ 2 dt = 1, Vn; and w',(/) £ C Mx 1 represents the 























Fig. 3: G n (N\, M ), n = 1,2,3, versus N\ for M = 2 and M = 5. 


additive white Gaussian noise (AWGN) at the receiver of the ET with power spectrum density 
consumed at the ER for channel training in this phase is 

K= J2 f 1 y/EiMt ) 2 = E iNi. 

neA/i J ° 




( 10 ) 


At the ET, the received training signal is first separated over different sub-bands. After complex-conjugate 
operation, each r{* (t) then passes through a matched filter to obtain 

yh= [ rl n (t)(j>n(t)dt = \/Eih n + z\, u£jVi, (11) 

J 0 

where z l n ~ CA/"(0, NqIm) denotes the i.i.d. AWGN vector. Based on < [TT| ), the ET determines the out of the 
N\ trained sub-bands that have the largest power gains. Specifically, the N± sub-bands in A/j are ordered based on 
the total received energy across all the M antennas such that 

llyjr] II 2 > 11 y [ 2] II 2 > • • • > llyfjvi] II 2 - ( 12 ) 

The ET then sends to the ER the ordered indices of the N 2 strongest sub-bands A /2 = {[1. ■ • ■ , [A^]}. 

2) Phase-11 Training: In the second phase of T 2 seconds, additional pilot signals are transmitted by the ER over 
the N 2 selected sub-bands for the ET to estimate the corresponding MISO channels. Denote by the training 
energy used for the ?rth strongest sub-band as determined in ( fT2| ), n = 1, • • ■ , With similar processing as in 
phase I, the resulting signal at the ET during phase II over sub-band [n] can be expressed as 

yjj] = s/E^\\ [n] + zj^, n = 1, ■ ■ ■ , JV 2 , (13) 


where zj^ ~ CAf(0, NqIm) denotes the AWGN. The ET then performs the linear minimum mean-square error 
(LMMSE) estimation for h[ n ] based on y Q To obtain the optimal LMMSE estimator, we first provide the 
following lemma. 

1 In principle, h[„j can be estimated based on both observations yj n j and . To simplify the processing of multi-band energy detection 
in phase-I training, we assume that yj„j is only used for determining the strongest sub-bands via estimating the power ||h[ u ]|| 2 while only 
y[^j is used for estimating the exact MISO channel hj n . 
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Fig. 4: G n (Ni, M ), n = 1,2, 3, versus M for JVi = 10 and N\ = 100. 


Lemma 1: Given that h„ and h m are independent Vn, m E A/i and n / m, the average power of h[ n ], where 
[n] denotes the nth strongest sub-band determined via phase-I training as in (fT2]), can be expressed as 


RniNuEj^ E 


hr, 


P 2 E 1 G n (N 1 ,M) + pN 0 M 
/3Ei + N 0 

,N 2 , 


n= l,--- , N 2 , (14) 

with G n (Ni,M) an increasing function of N\ and M as defined in (56 1 and (571 in Appendix [b] which satisfies: 

Gi{N u M) > G 2 {N 1 ,M) >■■■> G N2 (N u M) > M. (15) 

Proof: The proof of Lemma |T] requires some basic knowledge of order statistic [281, which is briefly introduced 
in Appendix [A] The detailed proof is given in Appendix [Cj following a useful lemma presented in Appendix |Bj ■ 
It can be inferred from Lemma [I] that R n {N\. E\) monotonically increases with the training-energy-to-noise ratio 


E\/No- In the extreme case when E\ /Nq —» 0, for which the energy comparison in ( f]~2| ) is solely determined by 
the additive noise and hence is unreliable, R n (Ni, Ei) in ( fT4) ) reduces to j3M, Vn, consistent with the assumed 
channel statistics in ([!]). In this case, no frequency-diversity gain is achieved. On the other hand, as Ei/Nq oo, 
R n (Ni, E\) approaches to its upper bound jjG n {N] ■ M). Thus, the ratio G n {N \, M)/M can be interpreted as 
the maximum frequency-diversity gain achievable for the nth strongest sub-channel by training Ni independent 
sub-bands. As an illustration, Fig. [5] and Fig. [4] plot the numerical values of G n (Ni, M) for n = 1, 2, 3 against iVi 
and M, respectively. It is observed that while G n {Ni, M) increases linearly with M, it scales with Ni only in a 
logarithmic manner. 

With Lemma [TJ the LMMSE estimator of the MISO channels corresponding to the N 2 selected sub-bands in 
phase-II training can be obtained as follows. 

Lemma 2: The LMMSE estimator h[ n ] for the MISO channel h[ n ] based on ( |T3| ) is 

\jE 2 ^ n R n (Ni , Ei ) n _ 

E 2: nRn(Ni,Ei) + N 0 M y ^ n ~ 2 - 


hfnl 


(16) 
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Define the channel estimation error as hr„i = hr„i — hr„i. We then have 


E 

E 

E 


^[n]^[n] 


NpMRnjNuE i) 
E^nRntN^E^ + NvM' 
= E 2 , n R 2 n (N 1 ,E 1 ) 

E 2>n R n (N 1 ,E 1 ) + N Q M' 

= 0, n = 1, • • • ,N 2 . 


(17) 

(18) 
(19) 


Proof: Please refer to Appendix [D] 


B. Net Harvested Energy Maximization 

After the two-phase training, energy beamforming is performed by the ET over the N 2 selected sub-bands based 
on the estimated MISO channels {h[ n ]}n=i for WET during the remaining time of T — n — r 2 seconds. According to 
([7]), the energy beamforming vector for the nth selected sub-band is set as s „| = v / ^h[ n ]/||h[ n ]||, n = 1, • • • , N 2 . 
The resulting energy harvested at the ER can be expressed a.^ 


* k,h w 

q = ,,tp,J2 ' 


n=l 

N 2 


hfnlll 2 


VTP S J2 l!h W H 2 + 




n=l 


l h wll 2 


+ h^ihr, 


+ hi^ihr. 


( 20 ) 


where we have used the identity h[ n ] = h„j + h- n | in (201. Based on (fT7|)-(fT9l), the average harvested energy at the 
ER can be expressed as 

Q(N 1 ,E 1 ,{E 2 , n }) = E\Q 

(M - 1)N 0 


n 2 


riTP s S }TR n (N l .E l )(\ 


72—1 


E 2) nRn(Ni, El) + N 0 M 


( 21 ) 


It is observed from ( |2T| ) that for each of the N 2 selected sub-bands, the average harvested energy is given by a 
difference of two terms. The first term, qTP s R. n {N\. E\), is the average harvested energy when energy beamforming 
is based on the perfect knowledge of hj n j over sub-band [n], where [n] is the nth strongest sub-band determined via 
phase-I training among the N\ trained sub-bands each with training energy E\. The second term can be interpreted 
as the loss in energy beamforming performance due to the error in the estimated MISO channel h[ n j in phase-II 
training. For the extreme scenario with E 2)U /Nq —> oo so that h„j is perfectly estimated, or in SISO system 
(M = 1) for which no energy beamforming can be applied, the second term in ( |2T| ) vanishes, as expected. 

The net harvested energy at the ER, which is the average harvested energy offset by that used for sending pilot 
signals in the two-phase training, can be expressed as 


n 2 


Qnet(iVl,£l,{£2,n}) = Q(iVi, E 1 , {E 2 , n }) - E U \, - ^ E 2 , 


( 22 ) 


72 — 1 

2 For notational simplicity, we assume that T is sufficiently large so that T n + as a result, the time overhead for channel training 
is ignored (but energy cost of channel training remains).If the above assumption does not hold, the results derived in this paper are still valid 
by simply replacing T with T — n — T 2 with any given n and T 2 . 
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The problem of finding the optimal training design to maximize Q net can be formulated as 

(PI): max Qnet(Ni, E ly {E 2 n }) 

N 1 ,E 1 ,{E 2tTl } 

subject to N 2 < Ni < N, 

Ei > 0, E 2) n >0, n = 1, ■ ■ ■ , N 2 . 

Note that in (PI), we have assumed that there is sufficiently large initial energy stored at the ER in the first block 
for sending training signals. Under this assumption and considering the fact that our optimized net energy Qnet 
in (PI) is strictly positive and generally much larger than that required for training (for implementing other more 
energy-consuming functions at the ER such as sensing and data transmission), we further assume that there is 
always enough energy available to send the training signals at the beginning of each subsequent block. 

In the following sections, we first develop an efficient algorithm to obtain the optimal solution to (PI), and then 
investigate the problem in the asymptotic regime with large M or large N to gain useful insights. 


IV. Optimal Training Design 

To find the optimal solution to (PI), we first obtain the optimal training energy {E 2>n }^l. 1 in phase II with Ni 
and Ei fixed. In this case, (PI) can be decoupled into N 2 parallel sub-problems. By discarding constant terms, the 
nth sub-problem can be formulated as 

(M - l)Noi]TP s R n (Ni, Ei) 


mm 


E 2 , n > o E 2 , n Rn(Ni,E 1 ) + N 0 M 


+ E 2r 


(23) 


Problem (231 is convex, whose optimal solution can be expressed in closed-form as 


E l n (Ni,Ei) = 


\Jr]TP s (M -1)N 0 - 


N 0 M 


Rn{Nl,Ei) 


+ 


n = !,-■■ , N 2 . 


(24) 


The solution in ((24]) implies that non-zero training energy should be allocated for the nth strongest sub-band in phase 
II only if its expected channel power R n (N\. E\) is sufficiently large. Furthermore, ( [24] ) has a nice water-filling 
interpretation, with fixed water level y/rjTP s {M — IjiVo and different base levels that depend on the expected 
channel power gain R n (Ni, E \). For sub-bands with huger R, n (N\, E\), more training energy should be allocated 
to achieve better MISO channel estimation, as expected. 


The corresponding optimal value of (231 is 


l(N 1 ,E 1 ) = 


I 


M—l 

M 


r]TP s R n {Ni,Ei) 


if R n (N 1 , Ei) < a, 


(25) 


\ 2 V( M - l)NovTP s - rJn^E!) > otherwise, 
where a = i/NqM / yjrfTP S (M — 1). By substituting the above optimal value of ([23]) into ( [22] ), the objective 


function of (PI) in terms of N\ and E\ only can be expressed as 


n 2 

Q ne t{Ni,Ei) = [j 1 TP,R n (Ni,E 1 ) - v*(Ni,Ei) 


n= 1 


- NiEi. 


(26) 
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case 2-.a<pM\ ' " \ j3G j+l <a< pG j \ 

PM PG jn PC,, 


case l:ar > fiG 1 

PG\ 


Fig. 5: Three different cases for solving problem (|28|). 


As a result, (PI) reduces to 


max Qnet(Ni,Ei) 

E± ,iVi 


(27) 


subject to N 2 < Ni < N, E\ > 0. 

To find the optimal solution to problem ( [27] ), we first obtain the optimal training energy E\ with N\ fixed by 
solving the following problem, 

max QnetC/Vi,#!). (28) 

Ei >0 

Denote by (Aj ) and Q* el (Aj) the optimal solution and optimal value of problem ( [28] ), respectively. The original 
problem (PI) then reduces to determining the optimal number of training sub-bands N\ as 


N* = arg max Q*et(JVi), 

1 V 2 ^ /V 1 \ 1 V 


(29) 


which can be easily solved by one-dimensional search. Therefore, the remaining task for solving (PI) is to find 
the optimal solution to problem ([28]). To achieve this end, it is first noted from ( fl4| ) that for any fixed A r i, as the 
training energy E\ varies from 0 to 00 , R, n {N \, E\) monotonically increases from @M to /3G n (N\. M ), i.e., 


PM < RniNuEi) < pG n (N u M), \/E 1 > 0, n = 1, ■ ■ ■ ,N 2 . 


Further define 


r A vTPsP 2 


N 0 


(30) 


(31) 


as the two-way effective signal-to-noise ratio (ESNR), where the term !3 2 captures the effect of two-way signal 
attenuation due to both the reverse-link training and forward-link energy transmission. Based on the property of 
G n (N \. M) given in ( fl5| ), problem ([28]) can then be solved by separately considering the following three cases 
according to the intervals in which a lies, as illustrated in Fig. [5] 

Case 1: a > dG \ (Aj. M), or F < > which corresponds to the low-ESNR regime or the so-called 

harsh environment for WET by taking into account all the relevant factors including the channel conditions, the 
power limits at the ET, and the achievable energy-beamforming/frequency-diversity gains. In this case, it follows 


from (30) that Il n (N\, E\) < a and E3 2 n {N\ .E\) = 0, Vn = 1, • • • , N 2 and E\ > 0. In other words, phase-II 
training should not be performed due to the limited energy beamforming gains. Therefore, Onett^i, £ 1 ) in 
reduces to 


n 2 


QnetiNx^i) = T]TP s y 


n=l 


Rn(N\, Ei) 

M 


— E\N\, ME\ > 0. 


(32) 
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By substituting R n (N \, E\) with ( fi~4| ). problem ( |28j ) reduces to 


No 


™ aX ^PE 1 G n (N 1 ,M)/M + N 0 r Ar 

max riTP s p > -—-—- E\Ni, 

E 1 >o orn 1 Ar 

n =1 


PE i + iV 0 

which is a convex optimization problem with the optimal solution given by 


(33) 


E^N,) = V^TPsNo 


N 


N 2 fG n (N lt M) 


E -lV 2 
71—1 


M 

ivT 


- i 


The corresponding optimal value of (J33J) can be expressed as 

i]TP s pN 2 , 

QUN i) = ' 


lTP.fi ( N 2 + (Ve*. ( 


G n (N 1 ,M) 

M 


-iWf 


1 


if Enil 


otherwise. 


(34) 


G n (Ni,M) 

M 


- 1 ) < ^ 


(35) 


It is observed from ( |34| ) that for any fixed N\, non-zero training energy should be allocated in phase I only if 
the frequency-diversity gain normalized by iVi is sufficiently large. In this case, the net harvested energy is given 
by a summation of that achieved in the case without CSI at the ET, rjTP s /3N 2 , and an additional gain due to the 
exploitation of frequency diversity, as shown in ([35]). 

Case 2: a < (3M, or T > 1/(M — 1), which corresponds to the high-ESNR regime or a favorable WET 
environment. In this case, it follows from ( |3Q| ) that R n (A T \ , E\ ) > a and E^ n (Nj. E\ ) > 0, Vn = 1, • • ■ , N 2 
and Ei > 0. In other words, regardless of the training energy E\ in phase I, non-zero training energy should be 
allocated to each of the N 2 selected sub-bands in phase II due to the potentially large beamforming gains. As a 

can be expressed as 


result, it follows from (25 1 that Q net (Ni, E±) in 

n 2 

Qnet(iVi, E x ) = ]T [r,TP s R n (Ni, E x ) - 2 y/(M - l)N oV TP s 
N 0 M 


72—1 

+ 


- N\E\. 


Rn(Ni, El). 

By substituting R n (Ni, Ei) with (|T4[) and after discarding constant terms, problem 


n 2 

max riTP s B 
Ei>0 ^ 

72—1 


PEiG n (Ni,M) + N 0 M 


+ 


NqM 


n 2 

E 


PEi + N 0 
PEi + N 0 


(36) 

can be reformulated as 

(37) 


-EiNi. 


P ^[pEiG n (Ni,M) + N 0 M 

Problem ( [37] ) belongs to the class of optimizing the sum of linear fractional functions, which is difficult to be 
optimally solved in general. However, with some manipulations, problem ([37]) can be simplified into 

bo „ b r , 


min /(EG = 

Ei>0 Ei + Co 


+ EiNi - 


72—1 


Ei + c n 


(38) 
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where 


TV, 


b 0 = vTPsNq Y (G n (N i, M) -M), co 


n=l 


No 
P ’ 


bn — 


NqM (i - 


M 


G„(TVi,M) 


(39) 


P 2 G n (N\, M) 


C-n — 


NqM 


PG n (N u M) 


, Vn. 


Problem ( [381 ) is still non-convex due to the non-convex objective function. However, by examining its Karush- 
Kuhn-Tucker (KKT) necessary conditions for global optimality (not necessarily sufficient due to non-convexity in 
general), the optimal solution to problem ( |38j ) must belong to the set 

df(E i) 


Si = {0} U \E 1 > 0 


dEi 


= 0 


By taking the derivative of the function f(E\) in ( [38] ) and after some algebraic manipulations, it can be shown 
that finding the static 
polynomial equation, 


that finding the stationary points satisfying = 0 is equivalent to determining the roots of the following 


n 2 n 2 

n^ei +co ) 2 n(*+ a ) 2 - b o n ^ 1 + c *) 2 

i —1 i =1 

N 2 n 2 

+ {Ei + co) 2 'Yj bn XT (■^'t 3“ c *) 2 = 

n= 1 

which has order 2 N 2 + 2. The cardinality of £\ is thus bounded by \£\ < 2N-2 + 3. Therefore, the optimal solution 
to problem ( |38] ) can be readily obtained by comparing at most 2A 7 2 + 3 candidate solutions as 

(40) 


E*(Ni) = arg min f(E 1 ). 

E'iGci 

With E* (A r i) determined, the corresponding optimal value of problem 
by direct substitution based on 


for the case a < (3M can be obtained 


Case 3: (iG ]+l (Aq, M) < a < BG 3 (N { ,M), 


or 


M 2 


< r < 


M 2 


r, for some j = 


(M—l)G 2 (N 1 ,M) - x - (M-l)G 2 +1 (TVi,M) 

1, • • • , N 2 . This corresponds to the medium-ESNR regime or a moderate WET environment. For convenience, we 

define Gjv 2 +i(-^i> M) = / 3M. Based on ( fT5| ) and ( [30| ), we have R n (N\ .E\) < a, Vn = j + 1, • • • , N 2 and E\ > 0. 
On the other hand, for n = 1, • • • , j, R n (N\ . E\ ) may be larger or smaller than a, depending on the training energy 
Ei. Let E\ be chosen such that Rj.+i(Ni, Ei) < a < Rk{N\, E\) for some k = 0, • • • ,j. Then based on (JT4[), 
we must have E\ G E^\ E[ k+1 ^ , where E^ = • ^ or convenience, we define e[' +1) = qq anc j 

= 0. For E\ belonging to the above interval, we have R n (N\. E\) > a, Vn = 1, • • ■ ,k, and R n (N ], E\) < a, 
Vn = k + 1, ■ ■ • ,j. Therefore, depending on the j + 1 intervals in which E\ lies, Q n ct(A r i, E\) in ( [26] ) can be 
explicitly written as 

Q*x{Ni,Ei) = f k {Ei), for Ei G [E [ k] >, E ^^ 1} ], k = 0, • • ■ , j, (41) 















where 
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fk(Ei) 4 ]T (pTP'RniNuEi) - 2\J(M — l)N 0 r]TP s + R J°™ Ei) ) 

N " =1 " ^ 1 (42) 

/ M — 1 \ 

+ Y lvTP,R n {N 1 ,E 1 )-^^rfrP a R n (N 1 ,E 1 )\-N 1 E 1 . 

n=k +1 ' ' 

As a result, problem ( [28] ) for Case 3 can be decomposed into j + 1 parallel sub-problems by optimizing Ej over 
each of the j + 1 intervals, with the /rth sub-problem formulated as 


max fk(Ei) 

Ei 

subject to 1 < I?i < 

Denote by the optimal solution to the A th sub-problem ( |43j ). The solution E* to problem 
by 


(43) 

is thus given 


E\(Ni) = arg max M E *,k), 
E ik> k=0 >- J 


(44) 


which can be readily determined. Thus, the remaining task is to solve problem (|43|). With some simple manipulations 


and after discarding irrelevant terms, ( |43| ) can be reformulated as 

do 


mm 


E[ k) <E 1 <E[ k+1) + Cq 


+ E l N l - Y 


77.—1 


E\ + c n 


(45) 


where cq, b n , and c n , for n = 1, • • ■ , k, are defined in (391, and 


k N 2 

do±vTP s N 0 (Y( G n(Ni,M)-M)+ Y ( 


G n (Ni,M) 

M 



n—l n—k -\-1 

Note that problem ( [45] ) has identical structure as problem ( [38] ) and thus can be optimally solved with the same 
technique as discussed above. 

The algorithm for optimally solving the training optimization problem (PI) is summarized as follows. 
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Algorithm 1 Optimal Solution to (PI) 


1: for Ni = N 2 , ■ ■ ■ , N do 
2 : if a > (5G\{N\,M) then 

3: Obtain E*(N\) and Q* c t(^ r i) based on < [34| ) and ( [35] ), respectively. 

4: else if a < j3M then 


5: Obtain E*(N\) and the corresponding Q^ a (N\) based on (40 1 and ( [36] ), respectively. 

6 : else 

7: Determine j £ {1, • • • , N 2 } such that fiG J+ \ (N\. M) < a < /3Gj(N\,M) 

8: for k = 0, • • ■ , j do 

9: Find E* k by solving ( |45[ ) and obtain the corresponding f k (E* k ) based on ( |42[ ). 

10 : end for 

11: Obtain E*(N\) and the corresponding QJ et (Ai) based on (44 1 and ( |4T| ), respectively. 

12 : end if 

13: end for 


14: Obtain N k and the corresponding E* based on ([29 1 . 


15: Obtain E^ n , n = 1, ■ ■ • , N 2 , based on ( |24| ) with iVf and E k . 


Algorithm [I] can be efficiently implemented since it has a polynomial complexity, as we show next. First, it is 
noted that for each iteration of the outer “for” loop, the worst-case scenario corresponds to the execution of the 
third case (lines 7-10). The interval searching in line 7 can be easily implemented with the bisection method of 
complexity Oflog A^), where O(-) is the standard “big O” notation. Line 9 involves solving polynomial equations 
of order 2A r 2 + 2, for which there exist various efficient algorithms of complexity no larger than 0((2A r 2 + 2) 3 ) [30]. 
Furthermore, since in the worst-case scenario, the outer (line 1) and inner (line 8) “for” loops will be executed 
by N — N '2 + 1 and A^ + 1 times, respectively, the overall complexity of Algorithm [I] can be expressed as 
0((N -N 2 + l)[log A 2 + (JV 2 + 1)(2JV 2 + 2) 3 ]), or 0((N - N 2 )N$) for large N and N 2 . 

V. Asymptotic Analysis 

In this section, we provide an asymptotic analysis for the training optimization problem (PI) with either a large 
number of ET antennas M (massive-array WET) or a large number of sub-bands N (massive-band WET), from 
which we draw important insights. 

A. Massive-Array WET 

Massive MIMO has emerged as one of the key enabling technologies for 5G wireless communication systems due 
to its tremendous improvement on energy and spectrum efficiency by deploying a very large number of antennas 


(say, hundreds or even more) at the base stations [ 131, [14|. For WET systems with a large number of antennas 
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deployed at the ET, it is expected that the energy transfer efficiency can be similarly enhanced, provided that the 
channel is properly learned at the ET to achieve the enormous beamforming gain. To obtain the asymptotic analysis 
for (PI) with M 1, we first present the following lemma. 

Lemma 3: For the function G n (Ni, M) defined in Lemma [9] in Appendix |Bj the following asymptotic result 
holds: 


G n (Ni,M)—tM, Vn, N 1: as M -> oo. (46) 

Proof: For the N\ i.i.d. CSCG random vectors v, ~ CAf(0, cr 2 Ijvf) given in Lemma[9j the following asymptotic 
result holds as M — y oo due to the law of large numbers: 

1 „ 


M 


V,; 


a v , V* = 1, ■ 


,N V 


(47) 

(48) 


Therefore, we have E[||v[ n ]|| 2 ] — > cr%M, Vn, which thus leads to ( |46| ) by definition. ■ 

Lemma 4: For MISO WET systems in frequency-selective channels with M transmit antennas and N independent 
sub-bands, as M —> oo, the optimized two-phase training scheme reduces to 

ge ‘ M -> 0, 

-> VvTP s N 0 M, n = 1, • ■ • , N 2 . 

Furthermore, the resulting net harvested energy scales linearly with M as 

Q':T M ->• r,TN 2 P sl 3M. (49) 

Proof: Please refer to Appendix [E] ■ 

Lemma |4] shows that due to the so-called “channel hardening” effect for large M, i.e., ||h n || 2 — > /3M, Vn = 
1, • • • , N, as M —> 00 , all the N sub-bands have essentially identical power gain f3M. Therefore, the frequency- 
diversity gain vanishes and hence phase-I training is no longer required, as shown in ( |47| ). Moreover, it is observed 
from ( [48) ) that as M increases, the training energy in phase II for each of the selected sub-band scales with M in 
a square root manner. Since the beamforming gain increases linearly with M, which dominates the training energy 


cost at large M, the net harvested energy scales linearly with M, as shown in ( |49| ). 

Lemma 5: For MISO WET systems in frequency-selective channels with M transmit antennas and N independent 
sub-bands, as M —> 00 , the average harvested energy in the ideal case with perfect CSI at the ET scales with M 
as 


Q 


ideal 


rtTNzPspM. 


(50) 
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Proof: With perfect CSI at the ET, the maximum harvested energy for each channel realization has been 

obtained in ([8]). The average value taken over all channel realizations can then be obtained as 

n 2 

0* deal = IE [Q ldeal ] = vTPs E [||h [n] || 2 ] 

n=1 


N 2 

= r)TP s Y J PG n (N,M) 

71 = 1 


rjTN 2 P s pM, 


(51) 

(52) 


where ( |5T| ) follows from the definition of the function G n (N \. M) given in Lemma [9] in Appendix |b| and (52) is 
true due to Lemma [3] ■ 

It immediately follows from Lemma[4]and Lemma[5]that for MISO WET systems in frequency-selective channels 
with M transmit antennas and N independent sub-bands, the proposed two-phase training scheme achieves the 
optimal asymptotic scaling with M. 


B. Massive-Band WET 

In this subsection, we investigate the WET system with an asymptotically large bandwidth B, or equivalently, 
with an unlimited number of sub-bands N. We first study the asymptotic scaling of the average harvested energy 
at the ER with N under the ideal case with perfect CSI at the ET. 

Lemma 6: For MISO WET systems in frequency-selective channels with M transmit antennas and N independent 
sub-bands, as N -» oo, the average harvested energy Q ldeal (jV) in the ideal case with perfect CSI at the ET scales 
logarithmally with N, i.e., Q ldeal (_/V) = 0(lniV). 

Proof: Please refer to Appendix [FJ ■ 

Lemma 7: For MISO WET systems in frequency-selective channels with M transmit antennas and N independent 
sub-bands, as N —> oo, the net harvested energy Q largl>;V with the proposed two-phase training scheme is upper- 
bounded by 

0^-" < „riWM( i + - ^ / = m ]) ■ (53) 

where T is the two-way ESNR defined in ( |3T| ), and W(z) is the Lambert W function defined by z = W(z)e w ( z \ 
Proof: Please refer to Appendix [G] ■ 

Lemma [7] shows that, different from the asymptotic linear scaling with the number of transmit antennas M as in 
Lemma[4j the net harvested energy at the ER does not asymptotically increase with N as N becomes large; instead, it 
is upper-bounded by a constant value that depends on the two-way ESNR, T. This result is unsurprising, since while 
the energy cost (E\N{) for channel training in phase I increases linearly with the number of trained sub-bands, the 
frequency-diversity gain scales only in a logarithmic manner with N\, as shown in Lemma [6j Consequently, beyond 
some large value of N, the benefit of channel training for exploiting the frequency-diversity gain cannot compensate 
the training energy cost; thus, further bandwidth expansion does not provide any performance improvement in terms 
of the net harvested energy at the ER. 
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Fig. 6 : Net average harvested power versus the number of trained sub-bands N± for M = 10 and M = 20. 


VI. Numerical Results 


In this section, numerical examples are provided to corroborate our study. To model the frequency-selective 


channel, we assume a multi-path power delay profile with exponential distribution A(t) = 


— I p /^rms 


, T > 0, 


where <r rms denotes the root-mean-square (rms) delay spread. We set cr rms = 1/is so that the 50% channel coherence 


bandwidth [31], i.e., the frequency separation for which the amplitude correlation is 0.5, is B c = 


2lUJ n 


160 


kHz. We assume that the system is operated over the 902-928MHz ISM band, which is equally divided into 
N = 866 sub-bands each with bandwidth B s = 30kHz. To comply with the FCC (Federal Communications 
Commission) regulations over this band [17], (Part 15.247, paragraph (e): the power spectral density from the 
intentional radiator “shall not be greater than 8 dBm in any 3kHz band”), we set the transmission power for each 
sub-band as P s = 60mW. Furthermore, according to the FCC regulations Part 15.247, paragraph (b) (3), the 
maximum output power over the 902-928MHz band shall not exceed lwatt, i.e., P t < lwatt. Therefore, at each 
instance, the maximum number of sub-bands that can be activated for energy transmission is IV 2 = [Pt/P s \ = 16. 
We assume that the power spectrum density of the training noise received at the ET is Nq = — 160dBm/Hz. The 
average power attenuation between the ET and the ER is assumed to be 60dB, i.e., 0 = 10“ 6 , which corresponds 
to an operating distance about 30 meters for carrier frequency around 900MHz. Furthermore, we assume that the 
energy conversion efficiency at the ER is 77 = 0.8. 

In Fig. [ 6 j by varying the number of sub-bands N\ that are trained in phase I, the net average harvested power 
achieved by the optimized two-phase training scheme is plotted for M = 10 and M = 20, where the average is 
taken over 10 4 random channel realizations. The channel block length is set as T = 0.05ms. The analytical result 
obtained in Section IV i.e., Q* et (Ni)/T with Q^ et (Ni) denoting the optimal value of problem ([28]), is also shown in 


Fig. [ 6 ] It is observed that the simulation and analytical results match quite well with each other, which thus validates 
our theoretical studies. Furthermore, Fig. [ 6 ] shows that there is an optimal number of sub-bands to be trained for 
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Fig. 7: Optimal training energy E\ and E->.\ versus block length T for M = 10 and M = 20. 


maximizing the net harvested energy, due to the non-trivial trade-off between achieving more frequency-diversity 
gain (with lager N\) and reducing the training energy (E\ A'i in phase I). 

In Fig. [7J the optimal training energy per sub-band E\ in phase I and that for the strongest sub-band £ 2,1 in 
phase II are plotted against the channel block length T, with T ranging from 0.1 to 100ms. It is observed that 
E\ and £ 0,1 both increase with T, which is expected since as the block length increases, more training energy 
is affordable due to the increased energy harvested during each block. It is also observed from Fig. [7] that more 
training energy should be allocated for system with M = 20 than that with M = 10, since energy beamforming is 
more effective and hence training is more beneficial when more antennas are available at the ET. Furthermore, for 
both cases of M = 10 and M = 20, £ 2,1 is significantly larger than £ 1 , since in phase II, only the IV 2 selected 
sub-bands need to be further trained, whereas the training energy in phase I needs to be distributed over N\ E- IV 2 
sub-bands to exploit the frequency diversity. 

In Fig. [8] the net average harvested power based on the proposed two-phase training scheme is plotted against 
block length T, with T ranging from 0.1 to 20ms. The number of antennas at the ET is M = 20. The following 
benchmark schemes are also considered for comparison. 

1) Perfect CSI: In this case, ideal energy beamforming as given in ([7]) can be performed and the average harvested 
energy is given in <HD- This corresponds to the performance upper bound since the frequency-diversity and 
energy-beamforming gains are maximally attained without any energy cost for CSI acquisition. 

2) No CSI: In this case, N 2 out of the N total available sub-bands arc randomly selected and isotropic energy 
transmission is performed over each of the selected sub-band. The resulting average harvested energy at the 
ER can be expressed as Qnocsi = p27P s /3JV 2 . Neither frequency-diversity nor beamforming gain is achieved 
in this case. 

3) Phase-1 training only: This corresponds to the special case of the proposed two-phase training scheme with 
£ 2 ,™ = 0, Vn. Thus, only frequency-diversity gain is exploited. 
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Fig. 8: Net average harvested power versus block length T for M = 20. 

4) Phase-11 training only: This corresponds to the special case of the two-phase training scheme with E\ = 0. 
Thus, only energy beamforming gain is exploited. 

5) Brute-force training: This refers to the single-phase training scheme where the exact MISO channels at all the 
N available sub-bands are estimated. Both frequency-diversity and energy-beamforming gains are exploited 
by this scheme, but it generally requires the most training energy among all the training schemes. 

It is observed from Fig. [8] that the proposed two-phase training scheme significantly outperforms the other four 
benchmark schemes. As the block length T increases, both the two-phase training and the brute-force training 
schemes approach the performance upper bound with perfect CSI, which is expected since the energy cost due to 
channel training becomes less significant with increasing T. It can also be inferred from the figure that for multi¬ 
antenna WET in frequency-selective channels, exploiting either frequency-diversity (with phase-I training only) or 
energy beamforming (with phase-II training only) gain alone is far from optimal in general; instead, a good balance 
between these two gains as achieved in the proposed two-phase training optimization is needed. 

In Fig. [9} to investigate the massive-array case with large M, the net average harvested power is plotted against the 
number of antennas M at the ET for T = 1ms. It is observed that both the schemes of no-CSI and phase-I training 
only result in poor performance. On the other hand, for the two training schemes that exploit energy beamforming 
gain (two-phase training and phase-II training only), the net average harvested power scales linearly with M, which 
is in accordance to Lemma |4j It is also observed from Fig. [9] that, under this setup, the optimized two-phase training 
still provides considerable performance gain over the phase-II training only, although the performance gap becomes 
inconsequential for extremely large M due to the linear scaling of both schemes, as expected from Lemma [4] 

In Fig. [I0| to investigate the massive-band case with large N, the net average harvested power is plotted against 
the total number of available sub-bands N for the case of SISO WET with A4 = 1 and M = 1 (thus, phase-II 
training is no longer needed and the proposed two-phase training is the same as phase-I training only), where 
N varies from 20 to 2000 (by assuming that some other spectrum in addition to the 902-928MFlz ISM band is 
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Fig. 9: Net average harvested power versus number of antennas M at the ET. 



Total number of sub-bands N 


Fig. 10: Net average harvested power versus total number of sub-bands N for SISO WET. 


available). The channel block length is set as T = 50ms. It is observed that in the ideal case with perfect CSI 
at the ET, the average harvested power increases logarithmically with N, which is in accordance to Lemma [6] 
In contrast, for the proposed training scheme that takes into account the energy cost of channel training, the net 
harvested power saturates as N becomes large, as expected from Lemma [7] Furthermore, Fig. 10 shows that in this 
case, the asymptotic result given in ( [53] ) provides a satisfactory performance upper bound for the proposed training 
scheme with large N. 


VII. Conclusions and Future Work 

This paper studies the energy-constrained training design for MISO WET systems in frequency-selective channels. 
By exploiting channel reciprocity, a two-phase training scheme is proposed. A closed-form expression is derived 
for the net harvested energy at the ER, based on which a training design optimization problem is formulated and 
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optimally solved. Asymptotic analysis is also presented for systems with a large number of transmit antennas or 
independent frequency sub-bands to gain useful insights. Numerical results are provided to validate our analysis and 
demonstrate the effectiveness of the proposed scheme which optimally balances the achievable frequency-diversity 
and energy-beamforming gains in multi-antenna WET with limited-energy training. 

There are a number of research directions along which the developed results in this paper can be further 
investigated, as briefly discussed in the following. 

• MIMO WET: The optimal training design for the general MIMO wide-band WET systems remains an open 


problem. MIMO WET for flat-fading channels have been studied in [|26|. However, the extension to the more 
complex frequency-selective channels is no-trivial and deserves further investigation. 

Correlated Channel: The analysis in this paper is based on the assumption of independent channels in both 
spatial and frequency domains, which practically holds with sufficient sub-band partition and antenna spacing. 
For the general setup with correlated channels, both the training optimization and performance analysis become 
more challenging; however, spatial and/or frequency channel correlations can also be exploited to reduce the 
training overhead and may enhance the overall energy transfer efficiency, which needs further investigation. 
Multi-User Setup: More research endeavor is needed to find the optimal training design for the general multi¬ 
user setups with near-far located ERs. In this case, the so-called “double near-far” problem reported in | 6 j, 
[26] needs to be addressed, where a far-away ER from the ET suffers from higher propagation loss than a 
close ER for both reverse-link channel training and forward-link energy transmission. 

Energy Outage: In this paper, the net average harvested power at the ER is maximized by optimizing the training 
design. In certain scenarios such as the delay-sensitive applications, a more appropriate design criterion may 
be minimizing the energy outage probability of ERs, for which the optimal training and energy transmission 
design still remain open. 


Appendix A 
Order Statistic 


This appendix gives a brief introduction on order statistic, which is used for the proof of Lemma [T] A compre¬ 


hensive treatment on this topic can be found in e.g. [28]. 

Definition 1: Let V\, - ■ ■ , V/v, be N\ real-valued random variables. The order statistics Vjq, • • • , Try, - are random 
valuables defined by sorting the values of Vi, ■ ■ ■ , Vy, in non-increasing order, i.e., Vm > Vr 2 i > ■ ■ ■ > kWi. 

Lemma 8: Let V\, ■ ■ ■ , Vy 1 be N\ i.i.d. real-valued random variables with cumulative distribution function (CDF) 
F(v). The CDF of the nth order statistic Vj n j, denoted as F- r ^(v), can be expressed as 

F [ri] (V) = J2 (^) F(v) Nl ~ k (1 - F(v)) k ,n = l,---,N 1 . 


(54) 
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Appendix B 
A Useful Lemma 


Lemma 9: Let vi, • • • , vjv x £ C Mxl be N\ i.i.d. zero-mean CSCG random vectors distributed as v n ~ CAf(0, <r 2 I m), 


Vn. Further denote by [•] the permutation such that ||v[ 1 j|j 2 > 11V [ 2 ]|[ 2 > ■ ■ ■ > Hvpvy 


IE 


= cr 2 G n (A^i,M), n = !,-■■ ,N U 


\ Then we have 


where G n (N\, M), n = 1, ■ ■ ■ , N\, is a function of N\ and M given by 

N x 


gi(w i ,m) = x; ("')(-i)»A 

p =1 ' V ' 


'Pi 


G n+1 (N 1 ,M) = G n (N 1 ,M)-A n , n = 1, ■ ■ • , N x - 1, 


(55) 

(56) 

(57) 


with 


k 0 +-k M -i=P 

/M—l 


(M -1 


^ = „Uu-- ( m! )*« 


x 


An = 


'JVt 


n 


y" y mk n 
\m=0 
n IVi-t 

EE 


pl+E^m^’ 


n\ AYj - l 


P 


(- 1 ) 


n— l+p 


Cp. 


(58) 


(59) 


Z=0 p= 0 

Note that in ((58]), the summation is taken over all sequences of non-negative integer indices k$ to Lm-i with 

P 


the sum equal to p, and the coefficients 
computed as 


ko, ■ ■ ■ , kM-i 


P 


are known as multinomial coefficients, which can be 


pi 


■ ■ ■ , kM-i) kol---kM~i ] - 

Furthermore, for n = 1 and M = 1, G n (N\. M ) can be expressed as the A'| th partial sum of the harmonic series, 
i.e., 


Nl 1 

Gi(W,,1) = E 7 


(60) 


i =1 


Proof: Define the random variables V n = ||v n || 2 , n = 1, • • ■ , N\ . It then follows that V], ■ ■ ■ , Uv, are i.i.d. 

Erlang distributed with shape parameter M and rate A = 1/cr 2 , whose CDF is given by 

M—1 

F(v) = 1 - V — -e~ Xv (Xv) m , v > 0. (61) 

' ml 


m =0 


The CDF F^(v) of the order statistics Vj n ] = |v1 | 2 is then obtained by substituting F(v) in ( fhTj ) into ( |54| ). The 
expectations can then be expressed as 

/*oo 

E [%] = / (! - ^[ n ]0)) dv , n = 1, • • • , Nl. 


(62) 
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We first show ( [55] ) for n = 1, i.e., the expectation of the maximum of N\ i.i.d. Erlang random variables. By 
substituting n = 1 into (541, we have i ? [i](n) = F(v) Nl . With binomial expansion, we then have 


M—l 


E [vinl = L 1 1 - 1 - E 

\ m =0 

E(p‘) ( - 1)P+1 “- 

p =1 ' ^ ' 


Ni ' 


dv 


(63) 


where 


n oo / jl ^'t 1 , 

L We^o 

1/11 \m=0 


E 


k 0 +---k M -i=P 

-'M — l 


P 


\koi i kM-iJ \ „ (m!) 

=r> \ m=u x / 


dv 

/M—l 


n 


X 


A Er=o^ 


m =0 

M-l „ 




1 


~ \° P ~ a "° 


v c P> 


(64) 

(65) 


where (64) follows from the multinomial expansion theorem, and ( [65] ) follows from the integral identity J 0 °° x n e px dx 
[32]. The result in ( [53] ) for n = 1 then follows by substituting ([65]) into 


To show ( [55] ) for n > 2, we apply the following recursive relationship obtained from ( p4[ ): 

F [n+1] (v) = F [n] (v)+^F(v) N ^ n (1 - F(v)) n , 
n = 1, ■ ■ ■ , Ni — 1. 

By substituting ( |66[ ) into < |62] >. we get 

E [V[ n+1] ] = E [V [n] ] - J™ (^Fiv)*-” (1 - F(v)) n dv. 


( 66 ) 


(67) 


By evaluating the integration in ( [67] ), the result in (551 for n > 2 can be obtained. 

Furthermore, by substituting M = 1 into ( |56| ) and with some manipulations, the expression for C\ (N \, 1) as 
given in ( |60| ) can be obtained. The property of G n (Ni, M) given in ( fT3| ) can be verified as well. 

This completes the proof of Lemma [9] ■ 

Appendix C 
Proof of Lemma m 

Note that in the absence of phase-I training {E\ = 0), the distribution of lp n simply follows from ([!]), which 

2 


thus leads to E 


hr, 


= /3M, Vn. In this case, no frequency-diversity gain is achieved. For the general scenario 
with Ei > 0, [n] is the nth strongest sub-band out of the N\ trained sub-bands as determined via ( fT2| ). As a 
consequence, the statistics of the channel hr n i are affected by phase-I training via ( fTT| ) and ( |75| ). To exploit such a 
relationship, we first show the following lemma. 
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Lemma 10: The input-output relationship in ( fiT[ ) is statistically equivalent to 

PVEl 


h,. — 


-y« + 




-t n , n G Mi, 


13Ei + Nq y pEi + N 0 

where t n ~ CAf(0 ,1 m) is a CSCG random vector independent of y l n , i.e., E [y^t^] =0, n <E M- 
Proof: It follows from ([T]) and ( [IT] ) that yj ( is a CSCG random vector distributed as 

yh ~ CAf (0, ((3Ei + Nq)1m) , Vn. 

Furthermore, the cross-correlation between h n and yj, is 

E [yj.hf] = 


( 68 ) 


(69) 


(70) 


Since both h„ and y n are zero-mean CSCG random vectors, to prove Lemma 10 it is sufficient to show that 
the random vector h n obtained by ( |68] ) has the same distribution as 0. and also has the same cross-correlation 
with yjj as ( [70] ). Such results can be easily verified from (68 > and ( |69| ). ■ 

By applying Lemma 10 for the N 2 selected sub-bands as determined in (fl2j), we obtain the following result, 


E 


hr. 


P 2 E 1 


(PEi + N 0 y 


E 


y[„ 


+ 


p 2 EiG n (N 1 ,M) + pN 0 M 


f3N 0 M 

Wi + No 

n = 1, • • • i N 2 , 


(71) 

(72) 


[3Ei + N 0 

where ( |72| ) follows from Lemma |9] in Appendix [B] and ( |69l ). 

This completes the proof of Lemma [T] 

Appendix D 
Proof of Lemma [2] 

Since both hj n ] and y|' ( - are zero-mean random vectors with i.i.d. entries, the LMMSE estimator can be expressed 
as h[ n ] = b n yj^j, with b n a complex-valued parameter to be determined. The corresponding MSE can be expressed 
as 


e n =E 


= E 
2 


1 b n y/£'2,nj B[n] ^n z [n] 

1 — b n y/E2,n Rn(N 1 , El) + \b n \ 2 NqM 
= \bn\ 2 (E 2 ,nRn{Ni,Ei) + N 0 M) - (b n + b* n ) ^E^R n (Ni, El) 

+ Rn(Ni, El). 

By setting the derivative of e n with respect to b* n equal to zero, the optimal coefficient b n can be obtained as 

, _ \/E2, n Rn(Nl, Ei) 

n ~ E 2l nRn(Ni, El) + NqM’ 


n = !,-■■ , N 2 . 
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The resulting MMSE can be obtained accordingly. Furthermore, the following result can be obtained, 

E2,nRn{Ni, E\) 


IE 


= K\'E ||yf B] 


.II 112 


E 2 ^ n R n {N 1 ,E l ) + N 0 M' 
n = 1, • • • • N 2 . 


(73) 


To show that E 


K] h w 


= 0, we use the following result, 


E 




= ME 


h H V n 
^[n] y n* 


E 2 Rl(N 1 ,E 1 ) 


E 2 Rh{N\, E i) + NqM 


Therefore, we have 


E 




= E 




= E 


-E 


= 0, Vn, 


(74) 


(75) 


where we have used the identity h[ n ] = h[ n ] — h[ n j. 

This completes the proof of Lemma [2] 

Appendix E 
Proof of Lemma [4] 

By applying ( |46| ) to ( fl4| ), we have R l n ge ~ M (N\, E\) —> f3M, Vn = 1, • • • , N 2 . As a result, the average harvested 
energy at the ER in ( |2T| ) reduces to 


n =1 ’ 


(76) 


which is independent of E\ and N\. Therefore, we should have 


ilarge-M 


0, i.e., the phase-I training for exploiting 


the frequency-diversity gain is no longer required. In this case, N 2 out of the N total available sub-bands can be 


arbitrarily selected for phase-II training in order to apply energy beamforming. Based on ( |24[ ). the optimal training 
energy for each of the N 2 sub-bands reduces to 


E 


ilarge-M 

2,n 


y/rjTP a (M - 1)N 0 - y 
y/r]TP s N 0 M, n = 1, • • • ,N 2 , as M — * oo. 


Furthermore, based on ( |36| ), the corresponding net harvested energy reduces to 

Q'Zr M -> N 2 (rfTPsPM - 2yJr,TP a N Q (M - 1) + 
-A rjTN 2 P s /3M, as M -A oo. 


This completes the proof of Lemma [4] 
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Appendix F 
Proof of Lemma [6] 

To prove Lemma [ 6 ] we first show the following result. 

Lemma 11: For the function G n (Ni,M) defined in Lemma [9] in Appendix | b| the following inequality holds: 

G n (N 1 ,M)<MG 1 (N 1 ,l),Vn = !,-■■ ,N 2 . (77) 


Proof: Since G n {N\, M) monotonically decreases with n as given in ( fl5| ). to prove Lemma 11 it is sufficient 
to show that Gi(N\, M) < To this end, we define N\ i.i.d. zero-mean CSCG random vectors 


Uj ~ CA7(0, Ijvf), * = !,••• , N\. Then based on the definition of G n (N\,M ) given in <55 1 , we have 





M 

G\(Ni,M) =E 

max || lx* 2 

= E 

max > \ua\ 2 

i=l, -Ni J 

i =1 



" M 

M 


<E 

max \ua\ 2 
^i=1,-Ni 31 

= E E 

max \ujn\ 2 
i=l,-Nf 


2=1 

7=1 

L J 


=MG 1 (N U 1), 


where Uij ~ CA7(0,1) denotes the jth element of u,.. Note that the last equality follows from the definition of 
G\(N, 1 ) and the fact that for any fixed j, {u tJ , are i.i.d. zero-mean unit-variance CSCG random variables. ■ 

With the expression in ([37]), the average harvested energy with perfect CSI at the ET is upper-bounded as 

n 2 

Q ideal (AT) = rjTPf 3 G n (N, M) < riTP s f3N 2 MG\(N, 1), (78) 

n= 1 


where the inequality follows from Lemma 11 


Furthermore, it is easy to see that Q ldeal (A r ) 


is lower-bounded as 


Q ldeal (A0 > riTP s (3Gi(N, M) > rjTP^G^N, 1), 


(79) 


where the last inequality follows since G\(N,M) monotonically increases with M. It follows from ( [78] ) and ( [79] ) 
that Q' deal (A r ) should have the same asymptotic scaling with N as G\ (N. 1). Furthermore, for sufficiently large N, 


it follows from (60) that Gi(N, 1) can be expressed as 


N I 

Gi(N, 1) = - = In Af + 7 + e N , 


( 80 ) 


i =1 

where 7 ~ 0.5772 is the Euler-Mascheroni constant and e ~ ^ which approaches 0 as N increases. Therefore, 
for A -) 00, the following inequalities hold, 


r)TP s fl In N < Q ide ' A (N) < r t TP s fN 2 M In N, 


( 81 ) 


or Q ldcal (A r ) = 0(ln A r ). This thus completes the proof of Lemma [ 6 ] 
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Appendix G 
Proof of Lemma [Tj 

To prove Lemma |7j we first derive an upper bound for the net harvested energy Q m t(Ni . E \, {E 2n }) of the 


two-phase training scheme for arbitrary N±, E\, and { E 2 , n \. Based on (22), we have 


No 


Qmt{Ni,Ei, {E 2n }) < r]TP s MNuEi) - E X N X 


n= 1 


N 2 




n =1 


PE X + N 0 


<,TP s pN 2 M ^ G ^ + N ^ E lNl 


PE 1 + No 


— Qnet(Nl,Ei), 


where the last inequality follows from Lemma 11 in Appendix [F] Note that the above bound is tight for M = 1 
and N 2 = 1. As a result, the net harvested energy achieved by the optimized training scheme is upper-bounded by 
the optimal value of the following problem. 

/n1 TT1 ^ rpu q A/r PEiG\(Ni, 1 ) + No 

(Pl-UB): max r{T P s fiN 2 M - — - E X N\ 

N U E ± pE\ + TVo 

subject to N 2 < N\ < N, 

Ei > 0 . 

With N\ fixed, (Pl-UB) is a convex optimization problem with respect to E\, whose optimal solution can be 
expressed as 


E? a *{Ni) = 


r]TP s N 2 MNq {G\ (JVi , 1) - 1) No 

P 


Ni 


(82) 


The corresponding optimal value of (Pl-UB) as a function of N\ can be expressed as 

r]TP s N 2 MP, if G 1 {Ni , 1) - 1 < A 

Qn U et(^l) = ' 


r,TP s N 2 Mp ^1 + (y/ G±(Ni, 1) — 1 - ^JrWPvf) J , otherwise. 
Therefore, problem (Pl-UB) reduces to finding the optimal number of training sub-bands as 

i). 


(83) 


max 

N 2 <Ni<N 


(84) 


It follows from ( f83j ) that if T is small such that Gi (N\. 1) — 1 < T nIm ’ VNi, Q™ is independent of N± and hence 
problem (|84|) has the trivial optimal value rjTP s N 2 Mp, which is smaller than the right hand side (RHS) of (53). 


On the other hand, for the non-trivial scenario with moderately large T values, problem ( |84| ) reduces to 

max yj Gi(Ni, 1) - 1 - J - 

N 2 <Ni <n v V 1 


Ni 


tn 2 m 


( 85 ) 
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Problem (851 is a discrete optimization problem due to the integer constraint on N\. To obtain an approximate 


closed-form solution to (85), we first apply the following approximation for the harmonic sum. 


iVi 


G\ ( Ni , 1) = - = In iVi + 7 + ejvj « In fVi + 1. 


i —1 


Furthermore, by relaxing the integer constraint and with asymptotically large N and moderate values, problem 

Ni 


( 851 can be approximated as 


max 


( 86 ) 


By setting the derivative to zero, the optimal solution A r * to problem ( [86] ) satisfies 

N* In N* = TN 2 M , or lriA^e 111 ^’ = TN 2 M. (87) 

Based on the definition of the Lambert W function, N * in ( [87] ) can be expressed as iV* = e w ( riV2M ), and the 
corresponding optimal value of problem ([84]) can be obtained as 


Q 


UB* 


"Tn^m{ 1 +(vwmm - 7W ^ m) 


( 88 ) 


This thus completes the proof of Lemma [7] 
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